Transforms Common Crawl into a refined long-term pre-training dataset.
Alibaba
$2
Input tokens/M
-
Output tokens/M
256
Context Length
Moonshot
$4
$16
Openai
$0.63
$3.15
131
Chatglm
$8
128
Huawei
4
Tencent
$12
28
$3.5
$10.5
16
Minimax
$1.6
1k
32
Sensetime
$3
$9
Iflytek
8